Evaluating Deterministic Policies in Two-Player Iterated Games
نویسندگان
چکیده
We construct a statistical ensemble of games, where in each independent subensemble we have two players playing the same game. We derive the mean payoffs per move of the representative players of the game, and we evaluate all the deterministic policies with finite memory. In particular, we show that if one of the players has a generalized tit-for-tat policy, the mean payoff per move of both players is the same, forcing the equalization of the mean payoffs per move of both players. In the case of symmetric, non-cooperative and dilemmatic games, we show that generalized tit-for-tat policies together with the condition of not being the first to defect, leads to the highest mean payoffs per move for the players.
منابع مشابه
Statistical Mechanics of Two-player Iterated Games
We construct a statistical ensemble of games, where in each independent subensemble we have two players playing the same game. We derive the mean payoffs per move of the representative players of the game, and we evaluate all the deterministic policies with finite memory. In particular, we show that if one of the players has a generalized tit-for-tat policy, the mean payoff per move of both pla...
متن کاملSearch Policies in Multi - Player Games 1
In this article we investigate how three multi-player search policies, namely maxn, paranoid, and Best-Reply Search, can be embedded in the MCTS framework. The performance of these search policies is tested in four different deterministic multi-player games with perfect information by running self-play experiments. We show that MCTS with the maxn search policy overall performs best. Furthermore...
متن کاملExploiting Evolutionary Modeling to Prevail in Iterated Prisoner's Dilemma Tournaments
The iterated prisoner’s dilemma is a famous model of cooperation and conflict in game theory. Its origin can be traced back to the Cold War, and countless strategies for playing it have been proposed so far, either designed by hand or automatically generated by computers. In the 2000s, scholars started focusing on adaptive players, that is, able to classify their opponent’s behavior and adopt a...
متن کاملPress-Dyson Analysis of Asynchronous, Sequential Prisoner's Dilemma
Two-player games have had a long and fruitful history of applications stretching across the social, biological, and physical sciences. Most applications of two-player games assume synchronous decisions or moves even when the games are iterated. But different strategies may emerge as preferred when the decisions or moves are sequential, or the games are iterated. Zero-determinant strategies deve...
متن کاملIterated Regret Minimization in Game Graphs
Iterated regret minimization has been introduced recently by J.Y. Halpern and R. Pass in classical strategic games. For many games of interest, this new solution concept provides solutions that are judged more reasonable than solutions offered by traditional game concepts – such as Nash equilibrium –. In this paper, we investigate iterated regret minimization for infinite duration two-player qu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- I. J. Bifurcation and Chaos
دوره 19 شماره
صفحات -
تاریخ انتشار 2009